An information-theoretic perspective of tf–idf measures
https://www.sciencedirect.com/science/article/abs/pii/S0306457302000213

A Mathematical Theory of Communication by Claude Shannon
https://people.math.harvard.edu/~ctm/home/text/others/shannon/entropy/entropy.pdf

TextRank: Bringing Order into Texts
https://web.eecs.umich.edu/~mihalcea/papers/mihalcea.emnlp04.pdf

Variations of the Similarity Function of TextRank for Automated Summarization
https://arxiv.org/abs/1602.03606

Generic Text Summarization Using Relevance Measure and Latent Semantic Analysis
https://www.cs.bham.ac.uk/~pxt/IDA/text_summary.pdf

Using Latent Semantic Analysis in Text Summarization and Summary Evaluation
http://textmining.zcu.cz/publications/isim.pdf

Spam Filtering with Naive Bayes – Which Naive Bayes?
http://www2.aueb.gr/users/ion/docs/ceas2006_paper.pdf

Sentiment analysis using multinomial logistic regression
https://ieeexplore.ieee.org/document/8226700

Latent Dirichlet Allocation
https://www.jmlr.org/papers/volume3/blei03a/blei03a.pdf

List of Hugging Face Pipelines for NLP
https://lazyprogrammer.me/list-of-hugging-face-pipelines-for-nlp/

Indexing by Latent Semantic Analysis (Latent Semantic Indexing)
http://lsa.colorado.edu/papers/JASIS.lsi.90.pdf

Efficient Estimation of Word Representations in Vector Space (word2vec)
https://arxiv.org/abs/1301.3781

GloVe: Global Vectors for Word Representation (GloVe)
https://nlp.stanford.edu/pubs/glove.pdf

Deep Learning with Tensorflow, a bit more in-depth
https://deeplearningcourses.com/c/deep-learning-tensorflow-2